Initial Experiments with Multilingual Extraction of Rhetoric Figures by means of PERL-compatible Regular Expressions

نویسنده

  • Daniel Devatman Hromada
چکیده

A language-independent method of figure-ofspeech extraction is proposed in order to reinforce rhetoric-oriented considerations in natural language processing studies. The method is based upon a translation of a canonical form of repetition-based figures of speech into the language of PERL-compatible regular expressions. Anadiplosis, anaphora, antimetabole figures were translated into the form exploiting the backreference properties of PERL-compatible regular expression while epiphora was translated into a formula exploiting recursive properties of this very concise artificial language. These four figures alone matched more than 7000 strings when applied on dramatic and poetic corpora written in English, French, German and Latin. Possible usages varying from stylometric evaluation of translation quality of poetic works to more complex problem of semi-supervised figure of speech induction are briefly discussed.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

159-2011: Perl Regular Expression in SAS® Macro Programming

In this paper, the Perl regular expression facility that provides a concise and flexible means for matching strings of text is extended to the macro environment using three new macro functions. Consequently, this allows direct pattern matching and replacement in macro variables, facilitating the construction of flexible and customized functions through internally parsing patterned inputs. This ...

متن کامل

A Critical Study of Selected Political Elites' Discourse in English

This study explored how political elites can contribute to power enactment through using language. It started with a theoretical overview of Critical Discourse Analysis (CDA), and then presented a corpus consisting of speeches of eight political elites, namely, Malcolm X, Noam Chomsky, Martin Luther King, Josef Stalin, Vladimir Lenin, Winston Churchill, J.F. Kennedy and Adolph Hitler. This stud...

متن کامل

ECEB: Enhanced Constraint Repetition Block for Regular Expression Matching on FPGA

Recent Network Intrusion Detection Systems (NIDSs) utilize Perl Compatible Regular Expression to describe malicious patterns existing in the content payload of packets more and more efficiently. Several techniques are introduced to optimize the performance or complete the system for full support all of PCRE features in hardware platform, but some issues have just been solved partially. Constrai...

متن کامل

Data Extraction from Internet

-Article data extraction from internet is a way to download and extract the required data automatically from web servers. In this paper, we present a method called the Internet Robot to extract the data directly from a web server by using Perl scripting language with the powerful regular expressions. The regular expressions are widely used in this method to reduce the complexity of the program ...

متن کامل

پیشگامان بلاغت فارسی در شبه‌قاره هندوستان

Undoubtedly, the Indian subcontinent has had an undeniable role in the enrichment and spread of Persian language and literature. It has led to the creation of some lasting works in various areas of literature, especially in Persian rhetoric, works that have been written with a unique creativity and innovation. In this study an attempt has been made to investigate three top rhetoric works in the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011